Reconstruction of Ancestral Gene Order Following Large Scale Genome Duplication and Gene Loss
نویسندگان
چکیده
Gene order evolves through gross chromosomal rearrangements, small scale inversions and transpositions, gene duplication, and gene loss. Much research has been done on the calculation of edit distance and on sorting algorithms under a variety of rearrangement models in which the genome may be represented as conserved segments with permuted order and orientation. However, gene loss within otherwise conserved segments, as typically occurs following large scale genome duplication, has not been well studied algorithmically. This has been a major impediment to comparative genomics in certain taxa, such as plants and fish. When large scale genome duplication and gene loss are occurring, how well can we infer both the true gene order within ancestral chromosomal segments and the ancestral ordering of those segments? We propose a heuristic algorithm for the inference of ancestral gene order in a set of genomes for which at least some genomic segments are partially related by common ancestry to two or more different segments. It does not require gene content and order to be perfectly conserved among segments. First, conserved chromosomal regions are identified using existing pairwise genomic alignment algorithms. Second, segments are iteratively clustered under the control of two parameters, (1) the minimal required number of shared genes between two segments or clusters and (2) the maximal allowed number of rearrangement breakpoints along the lineage leading to each descendant segment. Finally, we compute the estimated ancestral gene order for each cluster. We evaluate the performance of this algorithm on simulated data that models a genome evolving by large-scale duplication, duplicate gene loss, transposition, translocation, and inversion. The results suggest that ancestral gene orders may be estimated with sufficient accuracy to substantially improve the detection sensitivity of pairwise genomic alignment algorithms.
منابع مشابه
Reconstruction of Ancestral Gene Order after Segmental Duplication and Gene Loss
As gene order evolves through a variety of chromosomal rearrangements, conserved segments provide important insight into evolutionary relationships and functional roles of genes. However, gene loss within otherwise conserved segments, as typically occurs following large-scale genome duplication, has received limited algorithmic study. This has been a major impediment to comparative genomics in ...
متن کاملGene Family: Structure, Organization and Evolution
Gene families are considered as groups of homologous genes which they share very similar sequences and they may have identical functions. Members of gene families may be found in tandem repeats or interspersed through the genome. These sequences are copies of the ancestral genes which have underwent changes. The multiple copies of each gene in a family were constructed based on gene duplicati...
متن کاملGene Loss under Neighborhood Selection Following Whole genome Duplication and the Reconstruction of the Ancestral Populus genome
We develop criteria to detect neighborhood selection effects on gene loss following whole genome duplication, and apply them to the recently sequenced poplar (Populus trichocarpa) genome. We improve on guided genome halving algorithms so that several thousand gene sets, each containing two paralogs in the descendant T of the doubling event and their single ortholog from an undoubled reference g...
متن کاملBioinformatics Genome-Wide Characterization of the WRKY Gene Family in Sorghum bicolor
The WRKY gene family encodes a large group of transcription factors that regulate genes involved in plant response to biotic and abiotic stresses. Sorghum is a notable grain and forage crop in semi-arid regions because of its unusual tolerance against hot and dry environments. We identified a set of 85 WRKY genes in the S. bicolor genome and classified them into three groups (I–III). Among the ...
متن کاملCoevolution of gene families in prokaryotes.
We study gene family coevolution on a tree of life based on a large-scale ancestral gene content reconstruction, which includes gene duplication and deletion events. The insights obtained from this study are threefold: (1) Global properties, such as the distribution of coevolution partners and the formation of disconnected clusters of coevolving families, can be an inevitable consequence of evo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003